BIOTECHNOLOGY M$\ ga-S 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Pcr/o 1 



Source: 

Date Processed by STIC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mai! help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence' 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto,gov/web/offices/pac/checker 
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18 


<151> 


20 


<150> 


21 


<151> 


23 


<160> 


25 


<170> 



RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/936 f 883 TIME: 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883.raw 



APPLICANT: Miyata, Toshio 
TITLE OF INVENTION: A METHOD 
AND USE THEREOF 
FILE REFERENCE: SHIM012 

CURRENT APPLICATION NUMBER: US/09/936 , 883 
CURRENT FILING DATE: 2001-09-07 

"PRIOR APPLICATION NUMBER: PCT/JP00/0164 6 
PRIOR FILING DATE: 2000-03-17 
PRIOR APPLICATION NUMBER: 11/75305 
PRIOR FILING DATE: 1999-03-19 
PRIOR APPLICATION NUMBER: 11/306623 
PRIOR. FILING DATE: 1999-10-28 
NUMBER OF SEQ ID NOS : 21 

SOFTWARE: FastSEQ for Windows Version 4.0 



10/04/2001 
10:11:25 



Does Not Comply 
Corrected Diskette Needed 



FOR DETECTING MEGSIN PROTEIN 




ERRORED SEQUENCES 



27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 1867 

29 <212> TYPE: DNA 

30 <213> ORGANISM: Homo 
32 <400> SEQUENCE: 1 

3 3 at gqcctccc ttgctqcagc 

34 aadfrifcaguhc yshasftuhag 




sapiens 



aaatgcagag ttttgcttca acctgttcmt aasruaaaaa 60 



35/tcctctargg umtasasaSn 
39 ggccctggtc cgcttgggcg 
cctctctcag attgataagt 

8 vaasnthraa srggatatgg 

9 gytyrgyasn srsrasnsrg 
tcccacaagg attatgatct 
gaatgggctt tttgctgaaa 
yrgyhhsysg 



agagatggat gacaatcaag gaaatggaaa tgtgttcttt 120 

^jngyasngya snvabJbsrsr ctgagtietct 

ctcaagatus ruhaaaauaa uvaargugya 

tgcttcatgt taacactgcc tcaassrusr 

aaactcttct aatagtcagt cagggctcca 

nsrgyugnsr gnuaaaagag ttttttctga 

cysargvahs rasasnaasr hsysastyra 

aagtgtatgg ctttcataag srvaasngyu 



tcgct^ccc t\ 1 8 0 
agnasgactc B40 
gnasysuuhs 800 
gtctcaactg fe 60 
tataaatgca /420 
suagcattgt /4 80 
haaguysvat( 5 



42 

43 yrgucysaag 




"aoEaca^Ega 
uysutyrasa 



gtgtgccgaa 
aysvaguarg_ 



aaattaLacg 
attaacttta 



44/ agacgtaata 
4? aaaatgaaac 
sysasnvagy 
gygysrsrsr 
^caagagcgaa 



atgccaaagt 
cgaatcattt 



40 *N 

ggagcgaast 600 / 
agaagacact 660 



^CEaataagva 
acatggcaaa 
guggtggcat 
aavamtvauv 
accataaath 



ashthrasnh 
atcaagaacg 
aagctcatct 
aasnaavaty 
ysgyystrgn 



4 9 aatctcccaa 
50 aavaaamtmt 



gtgctctggg 
catcaggaac 



aaggcagtcg 
qqaagttcaa 




guargysn 
aacatgtacg 
ctctctgaaa 
naatctaatg 
hrasnrarga 
attatgaaat 



suguasthra 
tgattggtga 
gctgtaatgg 
rttcaaaggc 
sraahthrys 
ccatcfatgcy 

tttqtctqtt 



rgargasnas 
atrvaguasn 
tgctggtgaa 
aagtggcaat 
srguthrasn 



nystg^gttg 
guthrhsgyy 
tgctgtgtac 
cagccttcac 
tgccatttca 



T2XT 

780 

840 

900 

960 



shshysisrry 
attgaggacc 



asnusrvagu 
ttctgysugu 
ttgaaaacaa 
gaatggacca 
rgmtthrsry 
gaaaguvahh 



asrsrmtaag 
uargtyrasn 
actgaccttt 
atccaaggcg 
styrvagagg 
rgnhysguys 



attcttgagc 
gygyasnmtt 
cagurguasn 
aatgacctct 
tattttttcc 
asntyrgumt 



tcagaracaa 
yrvauctgcc 
asusrgugua 
aagtatgtta 
tcagttcaag 
yscaatattt 



scyssrgyys 1020 
catcaatghs 1080 
tgg tggc ata\l 140 



tgagaatgac 
snysuthrhg 
snumtgutrt 
atagagaaga 
gagagcccta 



1200 
1260 
1320 
1380 
1440 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/936,883 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883, 



DATE: 10/04/2001 
TIME: 10:11:25 



raw 



5 7 /gggctgaaag atatctttga tgaatccaaa gntvruarqa augyuysa sh asgusrysgcf^l500 

58 agatctctct gggattgctt cggggggtcg tctgtatata tcaaggaaas usrgyaasrg 1560 

59 ygyargutyr srargatgat gcacaaatct tacatagagg tcactgagga gggcaccgag 1620 ' * ^ 

60 gc tmtmthsy ssrtyrguva thrauguavt hrguaaactg ctgccacagg aagtaata tt 1 680 ^ 1 

61 (grEagaaaagc aac tccctca gtccthraaa athrgysras nvaguysgnu rgnsracggby T74 0 / 

62 gtttagagct gaccacccat tcctatttgt tatcaggaag gatthruhar gaaasnsrnu 1800 




gtttagagct gaccacccat tcctatntgt 

63 hvaargysas gacatcatct tattcagtgg 

64 asrcysr 

313 <210> SEQ ID NO: 18 
LENGTH: 1938 
TYPE ^JWA^ ) 

ORGANISM: Rattus norvegicus 
SEQUENCE: 18 



tcaggaag 
caaagtttct 



gatthruhai 
tgcccttgaa 



gaaasnsrnu 
suhsrgyysv 



1860 
1867 



314 <211> 

315 <212> 

316 <213> 
318 <400> 



E-- 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 



319 (£t&c 
20* ' 




aaa 



aguh gyhasttatt cagagagatg gatagtagtc aaggaaacgg aaatgtattc 
arggu mtassrsrgn gyasngyasn vahhtcttcc ctgagcatct tcactgccct 
ctsrsrusrh thraausrua rgugyaacga ggtgactgtn 
tctccarggy ascysaaarg gnasysaauh 
cagtcagcta ggactgcaat atrsrarggn 
gaaaagagtt cttgctgaca taaactcatc 
nsrsrhsysa saaaaactca gcattgccaa 
tysusraaas ngyvahaagu ysvahashca 
atacaatgct aaagtghsys srtyrmtguc 
tgattttaca aatgatatac aagaaaccag 
guthrarghy saataaatgg attgaaaatg 
snystrguas nguthrhsgy ysysysvaug 
tggtgctagt gaatgctgya ssrsrusrsr 
ggcaagtgga aatcggcctt caccaagagt 
yssrasthrc tcagttgcca tttcaggtct 
cyshshargs rrsrgyrgyy saavaasnat 
taccattcag gagccamtmt hsgnguarga 
ttgagctaca atatcatggt ggcataagca 
sgygysrmtt yratcatgtt gcccgaggat gacctatccg 
usrgugusry susrttccag aatctaatgg 
aghgnasnum tastrthras nsrargysmt 
ttcaagatag agaaagatta tgaatyrvaa 
gccacttgaa atctgtaggc ttggaagaca 
guashvagut ccagggctga tctgtctgga 
rgaaasusrg yaasrgygya rgutyrvatc 
ctcagaagaa ggcaccsrys umthsyssru 
acagaaagta acatcgttga aaagctactt 
uurgaatcca cggtgttcag agctgaccgc 
hargaaasar grhuhvaarg aagaatggca 
ctysasngyu hthrgyysva srcysrtgaa 
catgaagaaa catcataagt gaatagaatt 



60 ^ * 



321 ttc 

322 gagcctaatc cgtttgggtg 

323 nncgtcagat tgacaaggcc ctgcacttta 

324 shsrccatca agacaaggga attcatcgaa 

325 gyasnsrsra snsrgnugyu gntyrcaatt 

326 tcataaggat nnngnuysar gvauaaasas 

327 tggagttttt gcagagaaag tatttgattt 

328 taagagctat atggagtgtg ctgaaaactt 

329 ysaaguasnu tyrasnaays vagaaagagt 

330 atttaaaatt guargvaash thrasnasgn 

331 aaacacatgg caaaatcaag aaggtgttga 

332 gggacagcag cctcagctca tcagctgtca 

333 sraavamtva uvaasnaagt ttacttcaaa 

334 jataccvaty rhysgyystr yssraahthr 
3 35 ^ccagcggtc ctggaaaagc 

336 gatgcatcaa gaacggaggt 

337 rghasnusrt hrgngurcca 

338 tgtacrmtgn uguugntyrh 

339 aaattgaaag caagctgagt mturguasas 
34 0 actggacaaa tagcaggaag atgaaatctc 

341 yssrgntatg tgaatgtgtt tctcccccag 

342 snvahurgnh ysguysasty rguatgagga 
34 3 tctttgttga gmtargsrjis uyssrvagyu 

344 attgcctctg gaggtcgtct ctatgtasra 

345 aaagctaatg cacaagtccc tcatagaggt 

346 guvasrgugu gythrgaggc 
347/ cctguaathr aaaathrgus 
34 8 ccctttctgt ttgtcattag 
344 tcatcttatt tactggcaaa 
35D attctatttg gttttccata 
35jb-XK fcaattggaa ytacatgg 

353 <210> SEQ ID NO: 19 

354 <211> LENGTH: 380 

355 <212> TYPE: PRT 



agttaatusr 
tcaatttgtc 
atgcagattc 



aactgctgcc 
rasnvaguys 
ggusrthrva 
gtctcgtgtc 
cactaacagg 



120 
180 
240 
300 
360 
120 
180 
540 
600 
660 
720 
780 
840 
900 
960 
L020 
L080 
L140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1938 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,883 TIME: 10:11:25 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883.raw 



356 


<213> ORGANISM: 


Rattus i 


norvegicus 
















358 


<400> SEQUENCE : 


19 
























359 


Met 


Ala 


Ser 


Leu 


Ala 


Ala 


Ala 


Asn 


Ala 


Glu 


Phe Gly 


Phe 


Asp 


Leu 


Phe 


360 


1 








5 










10 










15 




361 


Arg 


Glu 


Met 


Asp 


Ser 


Ser 


Gin 


Gly 


Asn 


Gly Asn Val 


Phe 


Phe 


Ser 


Ser 


362 








20 










25 










30 






363 
364 


Leu 


Ser 


He 


Phe 


Thr 


Ala 


Leu 


Ser 


Leu 


He 


Arg 


Leu 


Gly Ala 


Arg 


Gly 
































E--> 365 


Asp 


Cys( 


Xaaj 


| Arg 


Gin 


He 


Asp 


Lys 


Ala 


Leu 


His 


Phe 


He 


Ser 


Pro 


Ser 


366 




50 










55 










60 










367 


Arg 


Gin 


Gly 


Asn 


Ser 


Ser 


Asn 


Ser 


Gin 


Leu 


Gly 


Leu 


Gin 


Tyr 


Gin 


Leu 


. 368 


65 










70 










75 








V 


80 


E--> 369 


Lys 


Arg 


Val 


Leu 


Ala 


Asp 


He 


Asn 


Ser 


Ser 


His 


Lys 


Asp 


pcaa 




Leu 


370 










85 










90 








\3 


'95 




371 


Ser 


lie 


Ala 


Asn 


Gly Val 


Phe 


Ala 


Glu 


Lys 


Val 


Phe 


Asp 


Phe 


His 


Lys 


372 








100 










105 










110 






373 


Ser 


Tyr 


Met 


Glu 


Cys 


Ala 


Glu 


Asn 


Leu 


Tyr 


Asn 


Ala 


Lys 


Val 


Glu 


Arg 


374 






115 










120 










125 








375 


Val 


Asp 


Phe 


Thr 


Asn 


Asp 


He 


Gin 


Glu 


Thr 


Arg 


Phe 


Lys 


He 


Asn 


Lys 


376 




130 










135 










140 










377 


Trp 


lie 


Glu 


Asn 


Glu 


Thr 


His 


Gly 


Lys 


He 


Lys 


Lys 


Val 


Leu 


Gly 


Asp 


378 


145 










150 










155 










160 


379 


Ser 


Ser 


Leu 


Ser 


Ser 


Ser 


Ala 


Val 


Met 


Val 


Leu 


Val 


Asn 


Ala 


Val 


Tyr 


380 










165 










170 


• 








175 




381 


Phe 


Lys 


Gly 


Lys 


Trp 


Lys 


Ser 


Ala 


Phe 


Thr 


Lys 


Ser 


Asp 


Thr 


Leu 


Ser 


382 








180 










185 










190 






383 


Cys 


His 


Phe 


Arg 


Ser 


Pro 


Ser 


Gly 


Pro 


Gly 


Lys 


Ala 


Val 


Asn 


Met 


Met 


384 






195 










200 










205 








385 


His 


Gin 


Glu 


Arg 


Arg 


Phe 


Asn 


Leu 


Ser 


Thr 


He 


Gin 


Glu 


Pro 


Pro 


Met 


386 




210 










215 










220 










387 


Gin 


lie 


Leu 


Glu 


Leu 


Gin 


Tyr 


His 


Gly 


Gly 


He 


Ser 


Met 


Tyr 


He 


Met 


388 


225 










230 










235 










240 


389 


Leu 


Pro 


Glu 


Asp 


Asp 


Leu 


Ser 


Glu 


He 


Glu 


Ser 


Lys 


Leu 


Ser 


Phe 


Gin 


390 










245 










250 










255 




391 


Asn 


Leu 


Met 


Asp 


Trp 


Thr 


Asn 


Ser 


Arg 


Lys 


Met 


Lys 


Ser 


Gin 


Tyr 


Val 


392 








260 










265 










270 






393 


Asn 


Val 


Phe 


Leu 


Pro 


Gin 


Phe 


Lys 


He 


Glu 


Lys 


Asp 


Tyr 


Glu 


Met 


Arg 


394 






275 










280 










285 








395 


Ser 


His 


Leu 


Lys 


Ser 


Val 


Gly 


Leu 


Glu 


Asp 


He 


Phe 


Val 


Glu 


Ser 


Arg 


396 




290 










295 










300 










397 


Ala 


Asp 


Leu 


Ser 


Gly 


He 


Ala 


Ser 


Gly 


Gly 


Arg 


Leu 


Tyr 


Val 


Ser 


Lys 


398 


305 










310 










315 










320 


399 


Leu 


Met 


His 


Lys 


Ser 


Leu 


He 


Glu 


Val 


Ser 


Glu 


Glu 


Gly 


Thr 


Glu 


Ala 


400 










325 










330 










335 




401 


Thr 


Ala 


Ala 


Thr 


Glu 


Ser 


Asn 


He 


Val 


Glu 


L ys 


Leu 


Leu 


Pro 


Glu 


Ser 


402 








340 










345 










350 






403 


Thr 


Val 


Phe 


Arg 


Ala 


Asp 


Arg 


Pro 


Phe 


Leu 


Phe 


Val 


He 


Arg 


Lys 


Asn 


404 






355 










360 










365 








405 


Gly 


He 


He 


Leu 


Phe 


Thr 


Gly 


Lys 


Val 


Ser 


Cys 


Pro 











/UUL 7 
~^ (Pr\ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/936, 883 



DATE: 10/04/2001 
TIME: 10:11:25 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883.raw 




370 
<210> SEQ 
<211> 
<212> 
<213> 
<400> 



. 9/2M 

guggtgctcg aggtgactgt gca cgtcaga 180 7 \ c i Q 
scysaaargg nasysaauhs haacatacca^240 S /C^*^ 



E--> 



E--> 
E--> 
E--> 

E--> 
E--> 
E--> 



375 380 

ID NO: 20 
LENGTH: 1848 
TYPE: DNA 

ORGANISM: Mus Musculus 
SEQUENCE: 20 

ttcgac ttat tcagagagat ggatagtagc caaggaaatg gaaatgtaha suharggumt 60 

assrsrg^py as^yas^a ttcttctctt ccctgagcat cttcactgcc ctgaccctaa 120 

tCCa tCtahh srsm grh1 " h rasnt-fif^r 

ttijacaaggc actgcacttt gyaaarggya 

tcaagacaag gaaactcatc taataatcag ccaggactta snrsrarggn gyasnsrsra 

snasngnrgy ucagtatcaa ttgaaaagag ttcttgctga cataaactca tctcataagg 

ntyrgnuysa rgvauaaasa snsrsrhs ys gattatgaac tcagcattgc cactggagt t 

tttacaaaaa aaaicLaias tyro uusraa rhrgyvahaa gny svatyrq antttnat.aa 

actacatt gagrgtgctg aaaacttata caatgctash hsysasntyr gucysaagua 

snutyrasna aaaagtggaa agagttgact tcacaaatga tgtacaagat accagattty 

svaguargva ashthrasna svagnasthr arghaaaatt aataaatgga ttgaaaatg 

gacacatgga aacratcaacra aavsasnyst rguasn guth rhsov vsvsv satqttggg 

gacagcagcc tcagctcgtc ggctgtcatg gtgct ggtgv aj ^gy^cTrq-r n firfir.q^aa v 

4 28 fSntvauvaaa cgctgt1:tac LLuadaqqca a atcrcraaatc ggccttcacc aagactasna 

429 avatyrhysg yystryssra ahthrysthr gataccctca gttgccgttt taggtctccc 

4 30 acgtg tcctg gaaaagtaas thrusrcysa rghargsrrt hrcysrg yys vagttaa tat 

4 3 l/%atgca Leaa yaa cgg cggt tcaatttgto — fca^ea^cag-va^fifivEmth sgnguar 

432 ghasnusrth rgncagccac caatgcaggt tcttgagctc caatatcatg gtggcataag _ . _ 

4 3 3s^cgnr rmtgnv auguugntyr hsgygysrat gtacattatg ctgcctg agg atggcc ta£g^ll40 
434 tgaaHtr 
43S 




43^ 
436 
43^ 



cysgugus rysctg agtt tccagaatct 
gatggactgg accaatagga ggaaaatgaa ausrhgnasn umtastr 



1200 



gy~T 

yr A 



E--> 
E--> 



r asnargargy 

smtystctca gtatgtgaac gtgtttctcc cccagttcaa gatagagaag aatsrgntyr 

yaasnvahur gnhysguysa sntatgaaat gacgcaccac ttgaaatcr.t tag grttg^ 

438 agatatcttt tyrgumtthr hshsuyssru gyuysashga tgagtccagt gcagatctct 

4 39 ctggaattgc ctctggaggt cgtctcasgu srsraaasus rgyaasrgyg yargutacgt 

440 atcaaagcta atgcacaagt cattcataga ggtctcagag gagtyrvasr ysumthsyss 

441 r hguvasrgu guggcactga agccactgct gccacagaaa ataacattgt tgaaaagc ag 162Q_ 
4 4 ^gy thrguaa t hraaaat hrg uasna qn Y Pi fl nyn g nrttnp tg n gt .rraca_jtgtt"rg^g* a Tff^ 

443 ccgaccgccc ctttctg ttt gtcurqusrt hrvahargaa as a rgrhuhv aatcaa gaag 174 0_ 1 \) sj 

444 (e hiUja u t iLu a LcLta tTrEac tg gcaaagtc tcttgtccty sysasnasuh thrgyysv a^s^Tyoo T/CxA^^K / 

445 rcyslTgaaa ttcgatttgg tttcctatac agtaacaggc atcaagaa ~ 1848 



260 
320 
1380 

1440 
1500 
1560 





Use of n and/or Xaa has been detected in the Sequence Listing. 

Review the Sequence Listing to insure a corresponding 
explanation is presented in the <220> to <223> fields of 
each sequence using n or Xaa. 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/936, 883 



DATE: 10/04/2001 
TIME: 10:11:26 



L 
L 
L 
M 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
M 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 



:11 M:270 C 
:12 M:271 C 
:34 M:340 E 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883.raw 

Current Application Number differs, Replaced Current Application Number 
Current Filing Date differs, Replaced Current Filing Date 
(46) "n" or "Xaa" used: Feature required, for SEQ ID# : 1 



:340 Repeated in SeqNo=l 



132 M:258 W 

132 M:258 W 

132 M:341 W 

137 M:220 C 

144 M:258 W 

144 M:258 W 

144 M:341 W 

156 M:258 W 

i56 M:258 W 

156 M:341 W 

161 M:220 C 

218 M:220 C 

319 M:321 E 

320 M:340 E 
320 M:320 E 
320 M:321 E 



Mandatory Feature missing, <221> not found for SEQ ID# : 3 
Mandatory Feature missing, <222> not found for SEQ ID# : 3 
(46) "n" or "Xaa" used, for SEQ ID#:3 

Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:4 
Mandatory Feature missing, <221> not found for SEQ ID# : 4 
Mandatory Feature missing, <222> not found for SEQ ID# : 4 

46) "n" or "Xaa" used, for SEQ ID#:4 
Mandatory Feature missing, <221> not found for SEQ ID# : 5 
Mandatory Feature missing, <222> not found for SEQ ID# : 5 

46) "n" or "Xaa" used, for SEQ ID# : 5 
Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 6 
Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:11 

1) "t" not allowed in RNA Sequence, NUMBER OF INVALID 't' KEYS: 15 

46) "n" or "Xaa" used: Feature required, for SEQ ID#:18 



:34 0 Repeated in 



321 M:320 E 

321 M:321 E 

322 M:320 E 

322 M:321 E 

323 M:320 E 

323 M:321 

324 M:320 

324 M:321 

325 M:320 

325 M:321 

326 M:320 

326 M:321 

327 M:320 

327 M:321 

328 M:320 

328 M:321 

329 M:320 

329 M:321 

330 M:320 E 

330 M:321 E 

331 M: 320 E 

331 M:321 E 

332 M:320 E 

332 M:321 E 

333 M:320 E 

333 M:321 E 

334 M:320 E 



Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 12 
"t" not allowed in RNA Sequence, NUMBER OF INVALID 't' KEYS: 27 
SeqNo-18 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 12 

"t" not allowed in RNA Sequence, NUMBER OF INVALID 't f KEYS: 39 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 11 

"t" not allowed in RNA Sequence, NUMBER OF INVALID 't» KEYS: 50 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 9 

"t" not allowed in RNA Sequence, NUMBER OF INVALID *t' KEYS: 59 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 9 

"t" not allowed in RNA Sequence, NUMBER OF INVALID *t' KEYS: 68 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 11 

"t" not allowed in RNA Sequence, NUMBER OF INVALID 't f KEYS: 79 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 6 

"t" not allowed in RNA Sequence, NUMBER OF INVALID f t' KEYS: 85 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 14 

"t" not allowed in RNA Sequence, NUMBER OF INVALID *t' KEYS: 99 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 15 

"t" not allowed in RNA Sequence, NUMBER OF INVALID »t' KEYS : 114 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 10 

"t" not allowed in RNA Sequence, NUMBER OF INVALID *t' KEYS: 124 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 12 

"t" not allowed in RNA Sequence, NUMBER OF INVALID f t' KEYS: 136 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 7 

"t" not allowed in RNA Sequence, NUMBER OF INVALID ! t' KEYS: 143 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 11 

"t" not allowed in RNA Sequence, NUMBER OF INVALID f t* KEYS: 154 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 11 

"t" not allowed in RNA Sequence, NUMBER OF INVALID ' t' KEYS: 165 

Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 13 
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Page 6 of 7 



VERIFICATION SUMMARY 
PATENT APPLICATION: US/09/936 , 883 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF3\10042001\I936883.raw 



DATE: 10/04/2001 
TIME: 10:11:26 



L: 334 
L:335 
L: 335 
L: 336 
L: 336 
L: 337 
L: 337 
L:338 
L:338 
L:339 
L:339 
L: 340 
L: 340 
L:341 
L: 341 
L: 342 
L: 342 
L:343 
L:343 
L:344 
L:344 
L:345 
L:345 
L:346 
L: 346 
L:347 
L: 347 
L:348 
L:348 
L:349 
L:349 
L:350 
L: 350 
L:351 
L:351 
L:365 
M:340 
L:416 
M:340 



M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 E 
M:320 
M: 321 
M:320 
M:321 
M:320 
M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 
M:320 
M:321 
M:320 
M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 
M:320 
M:321 
M:320 
M:321 E 
M:320 E 
M:321 E 
M:320 E 
M:321 E 
M:340 E 
Repeated 
M:340 E: 
Repeated 



(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(1) Wrong Nucleic Acid 
(1) "t" not allowed in 
(46) "n" or "Xaa" used: 
in SeqNo=19 
(46) "n" or "Xaa" 
in SeqNo=20 



RNA Sequence, NUMBER OF INVALID "f KEYS: 178 
Designator, NUMBER OF INVALID KEYS : 6 
RNA Sequence, NUMBER OF INVALID ' t* KEYS: 184 
Designator, NUMBER OF INVALID KEYS: 13 
RNA Sequence, NUMBER OF INVALID 't 1 KEYS: 197 
Designator, NUMBER OF INVALID KEYS: 12 
RNA Sequence, NUMBER OF INVALID 't' KEYS: 209 
Designator, NUMBER OF INVALID KEYS: 13 
RNA Sequence, NUMBER OF INVALID 't f KEYS: 222 
Designator, NUMBER OF INVALID KEYS: 10 
RNA Sequence, NUMBER OF INVALID 't f KEYS: 232 
Designator, NUMBER OF INVALID KEYS : 9 
RNA Sequence, NUMBER OF INVALID 't f KEYS: 241 
Designator, NUMBER OF INVALID KEYS: 16 
RNA Sequence, NUMBER OF INVALID , t* KEYS: 2 57 
Designator, NUMBER OF INVALID KEYS : 9 
RNA Sequence, NUMBER OF INVALID 't f KEYS: 266 
Designator, NUMBER OF INVALID KEYS: 13 
RNA Sequence, NUMBER OF INVALID ! t' KEYS: 2 79 
Designator, NUMBER OF INVALID KEYS: 12 
RNA Sequence, NUMBER OF INVALID f t' KEYS: 291 
Designator, NUMBER OF INVALID KEYS : 8 
RNA Sequence, NUMBER OF INVALID 't' KEYS: 299 
Designator, NUMBER OF INVALID KEYS: 10 
RNA Sequence, NUMBER OF INVALID 't' KEYS: 309 
Designator, NUMBER OF INVALID KEYS : 8 
RNA Sequence, NUMBER OF INVALID 't 1 KEYS: 317 
Designator, NUMBER OF INVALID KEYS: 12 
RNA Sequence, NUMBER OF INVALID f t f KEYS: 329 
Designator, NUMBER OF INVALID KEYS: 15 
RNA Sequence, NUMBER OF INVALID ? f KEYS: 344 
Designator, NUMBER OF INVALID KEYS: 19 
RNA Sequence, NUMBER OF INVALID 't' KEYS: 363 
Designator, NUMBER OF INVALID KEYS : 5 
RNA Sequence, NUMBER OF INVALID *t f KEYS: 368 
Feature required, for SEQ ID#:19 



used: Feature required, for SEQ ID#:20 
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Raw Sequence Listing Error Summary 



0<?/<?3 £,m 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWAF 



1 Wrapped Nucleics 

Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping.'* 



Invalid Line Length The rules require that a line not exceed 72 characters in length, this includes white spaces. 



Misaligned Amino 
Numbering 

Non-ASCII 



_ Variable Length 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved in ASCII text. 

Sequences) contain n's or Xaa's representing more thah, one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused foe <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patemjth would* automatically, generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

_Skipped Sequences Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where M X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ*ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:*' response to include the skipped sequences. 



6 Patentln 2.0 

"bug" 



8 Skipped Sequences ' Sequences) _ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



* (NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



10 Invalid <213> 

Response 

II Use of<220> 



12 Patentln 2.0 

"bug" 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been delected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequences) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <2i20> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of-n ' n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/2 1/2001 



